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Abstract 



Phylogenetic networks are a generalization of phylogenetic trees that allow for the 
representation of non-treelike evolutionary events, like recombination, hybridization, or 
lateral gene transfer. In a recent series of papers devoted to the study of reconstructibil- 
ity of phylogenetic networks, Moret, Nakhleh, Warnow and collaborators introduced the 
so-called tripartition metric for phylogenetic networks. In this paper we show that, in 
fact, this tripartition metric does not satisfy the separation axiom of distances (zero 
distance means isomorphism, or, in a more relaxed version, zero distance means indis- 
tinguishability in some specific sense) in any of the subclasses of phylogenetic networks 
where it is claimed to do so. We also present a subclass of phylogenetic networks whose 
members can be singled out by means of their sets of tripartitions (or even clusters) , 
and hence where the latter can be used to define a meaningful metric. 

Keywords. Phylogenetic networks, recombination, bipartitions, tripartitions, 
tripartition metric, error metric 

1 Introduction 

Phylogenetic trees have been used since the days of Darwin ^ to represent evolutionary 
histories of sets of species under mutation. Their popularity and prevalence have led to the 
introduction of many methods to their reconstruction, combination, and comparison [HllHl 
[T9] . But, as Doolittle pointed out about a decade ago [7], the history of life cannot be 
properly represented as a tree. Phylogenetic networks are used then as a generalization of 

*This work has been partially supported by the Spanish CICYT project TIN2004-07925-C03-01 GRAM- 
MARS, by Spanish DGI projects MTM2006-07773 COMGRIO and MTM2006-15038-C02-01, and by EU 
project INTAS IT 04-77-7178. 
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phylogenetic trees that allow for the representation of non-treelike evolutionary events, like 
recombination, hybridization, or lateral gene transfer [1]. 

The natural model for describing an evolutionary history is a directed acyclic graph 
(DAG for short) representing the parent-child relation. Phylogenetic trees are rooted DAGs 
where each node other than the root (which represents the common ancestor of all indi- 
viduals under consideration, be them species or biomolecular sequences) has at most one 
parent, from which it has been derived through mutation. Phylogenetic networks are rooted 
DAGs containing tree nodes, which have only one parent and thus correspond to regular 
speciation events, and hybrid nodes, which have more than one parent and thus correspond 
to hybrid speciation events. To such a DAG several extra conditions have been added in 
the literature to provide a realistic model of recombination |20y 21j or simply to narrow the 
output space of reconstruction algorithms [HITO] . 

In a series of papers [TTl[T2l[13l[lll[T5l[16l[T7] devoted to the study of reconstructibility 
of phylogenetic networks, Moret, Nakhleh, Warnow and collaborators introduced a method 
to compare a reconstructed network and the true phylogeny, the so-called tripartition, or 
also error, metric. This method is based on the association, to each node v of the network, 
of a tripartition of its set of leaves into those that are strict descendants of v (that is, such 
that every path from the root to the leaf contains v), those that are non-strict descendants 
of it (that is, that are descendants but not strict descendants), and those that are not 
descendants of v. These tripartitions may be enriched with some extra information like, 
for instance, the greatest number of hybrid nodes in a path from v to each leaf, or the 
sets of descendants of the parents of hybrid nodes. Notice anyway that these tripartitions 
are a natural generalization to non-tree networks of Bourque-Robinson-Foulds bipartitions, 
which associate to each node of a phylogenetic tree the partition of its leaves into descendant 
and non-descendant ones. These bipartitions are used to define one of the most popular 
distances for phylogenetic trees [3tlT8]. 

One of the key points in the definition of this tripartition metric is the claim that the 
sets of tripartitions discriminate, up to isomorphism, phylogenetic networks in a suitable 
subclass of them. That is, that if two phylogenetic networks and N' in this subclass have 
the same sets of tripartitions, then they are isomorphic. This turns out to be equivalent 
to the separation axiom for the tripartition metric (zero distance means isomorphism). In 
this paper, we provide counterexamples showing that all claims made in this connection in 
those papers are untrue, and thus that the tripartition metric does not satisfy the separa- 
tion axiom in any of the cases considered by the authors, even in the more relaxed sense 
of [11], where zero distance is simply claimed to be equivalent to equality modulo a certain 
specific notion of indistinguishability. Therefore, the tripartition metric cannot be used in a 
meaningful way to compare phylogenetic networks in the classes considered in the aforemen- 
tioned papers, as it cannot decide the equality (or even indistinguishability) of networks. 
Then, in the last section, we show a slight variant of the class considered in one of these 
papers where tripartitions, and even bipartitions in the sense of Bourque-Robinson-Foulds, 
do define a metric. 

2 Notations on DAGs 

Let A^ = {V,E) be a directed acyclic graph (DAG). We denote by di{u) and do{u) the 
in-degree and out-degree, respectively, of a node u G V. 

A node v & V is a leaf if do{v) = 0. A node v V is a tree node if di{v) ^ 1. Such a 
tree node is a root if di{v) = 0, and internal if di{v) = 1 and do{v) > 0. A node v & V is 
hybrid if di{v) > 1. We denote by Vl, Vt, and Vh the sets of leaves, of tree nodes, and of 
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hybrid nodes of N, respectively. An arc {u, v) & E is a tree arc if its head is a tree node, 
and a network arc if v is hybrid. 

A clade of a DAG iV is a subtree of N with set of nodes contained in Vr and set of 
leaves contained in Vl- 

A node v £ V is a child of u G F if (u, v) € E; we also say that u is a parent of v. All 
children of the same node are said to be siblings of each other. The tree children of a node 
u arc its children that are tree nodes. 

Let S be any finite set of labels. We say that the DAG is labeled in S when its leaves 
are bijectively labeled by elements of S. Two DAGs N,N' labeled in S are isomorphic, 
in symbols N = N', when they are isomorphic as directed graphs and the isomorphism 
preserves the leaves' labels. 

We shall always assume, usually without any further notice, that the DAGs appearing 
in this paper are labeled in some set S, and we shall always identify, usually without any 
further notice either, each leaf of a DAG with its label in S. 

A path on a DAG = {V, E) is a sequence of nodes {vq,vi, . . . , v/.) such that (wj-i, Wj) G 
for alH = 1, . . . , fc; such a path is a cycle if Vk = vq. We call vq the origin of the path, 
vi, . . . , its intermediate nodes, and its end. The length of the path {vq, vi, . . . , v^) 
is k, and it is non-trivial ii k ^ 1. We denote hy u^v any path with origin u and end v. 

The relation ^ on F defined by 

u ^ V there exists a path u-^v 

is a partial order, called the path ordering on N. Whenever u ^ we shall say that v is a 
descendant of u and also that u is an ancestor of v. 

3 Tripartitions 

Let N = {V, E) be any DAG labeled in S. For every node u eV: 

• Let C{u) C 5 be the set of leaves that are descendants of u. We call C{u) the cluster 
of u. 

• Let A{u) C C{u) be the set of leaves that are strict descendants of u: that is, those 
leaves s such that every path from a root of A?^ to s contains the node u. We call A{u) 
the strict cluster of u. 

• Let B{u) C C{u) be the set C{u) \ A{u) of leaves that are non-strict descendants of 
u: those leaves s that are descendants of u, but for which there exists some path from 
a root to s not containing the node u. 

• Let C"^(u) C 5 be the set Vl \ C{u) of leaves that are not descendants of u. 

A phylogenetic tree on a set S of taxa is a rooted tree with its leaves labeled bijectively 
in S, i.e., a rooted DAG labeled in S without hybrid nodes. Notice that, in a phylogenetic 
tree, C(u) = A(u) and B{u) = for every node u. This property actually characterizes 
phylogenetic trees among all rooted DAGs. 

Every arc e = {u, v) of a phylogenetic tree T = {V, E) on S defines a bipartition of S 

7r(^)(e) = (C(«),C»). 

Let 7r(r) denote the set of all these bipartitions: 

7r(r) = {7r(^)(e) \ eeE}. 
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The bipartition distance [3l[T8] between two phylogenetic trees T and T' on the same 
set S of taxa is defined as 

dAT,T') = i(|^(T) \^(T')| + k(T') \vr(r)|). 

The bipartition distance is a true distance for phylogenetic trees, in the sense that it 
satisfies the axioms of distances up to isomorphisms: for every phylogenetic trees T, T', T" 
on the same set S of taxa, 

(a) Non-negativity: dT^{T,T') ^ 

(b) Separation: d-,^{T, T') = if and only if T = T' 

(c) Symmetry: dT^{T,T') = d.^(T' ,T) 

(d) Triangle inequality: d-,^{T,T') ^ dj^iTjT") + dTj{T" ,T') 

Phylogenetic networks [Ij are usually defined as a subclass of DAGs that extend phylo- 
genetic trees by allowing the existence of hybrid nodes representing recombination or lateral 
gene transfer events. In a non-tree DAG it still makes sense to consider bipartitions vr 
associated to arcs e = {u,v), 

7:(^\e) = {C{v),C%v)), 

and then to define 

Tr{N) = {7r(^)(e) \ e £ E}. 

and 

d^{N,N') = ^{\7riN)\n{N')\ + |vr(Ar') \ vr(Ar)|) . 

As we shall see in Section [9l there even exist subclasses of non-tree DAGs where d-j^ is a 
distance. 

But, to distinguish between strict and non-strict descendants gives more information 
about the topological relations between the hybrid nodes. This is the reason why, in 
the series of papers [IIl[l2l[l3l[lll[l5l[l6l[I7], Moret, Nakhleh, Warnow, and collaborators 
associate to each arc e = {u, v) of a DAG N labeled in S, the tripartition of S 

e^''\e) = {A{v),B{v),C'{v)). 

As an extra piece of information about the topology of the DAG, the leaves s in A{v) and 
B{v) can be weighted with the maximum number of hybrid nodes contained in a path from 
t; to s (including v and s themselves). Therefore, we can distinguish between: 

• The {unweighted) tripartition 6^^\e) |16 y i5 [ [T^. 

• The B-weighted tripartition 9^^\e), where the elements of B[v) are weighted as 
indicated [13j. 

• The AB-weighted tripartition 6^^^{e), where the elements of both A{v) and B{v) are 
weighted as indicated [m[T2] . 

For every type of tripartition T = 6, 6b, Oab, let us denote by T{N) the set of all these 
tripartitions of arcs of N: 

T{N) = {T(^)(e) \ eeE}. 
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The tripartition distance relative to T between two DAGs A'^i = (Vi, Ei) and = iV2, E2) 
labeled in the same set S can be defined then, by analogy with the bipartition distance, by 



dr{Ni,N2) = ^(|T(Afi)\T(iV2)| + |T(iV2)\T(iVi)|) 



It is obvious that dy always satisfies the non-negativity, symmetry and triangle inequal- 
ity axioms of distances on the class of all DAGs labeled in a fixed set S. As far as the 
separation axiom goes, notice that d-^iNi, N2) = if and only if T(A^i) = T(A'^2)- There- 
fore, dj- satisfies the separation axiom on a certain subclass of DAGs if and only if T{N) 
characterizes up to isomorphism among all DAGs in this subclass. When these equiv- 
alent conditions hold, we shall say that the tripartition T satisfies the separation property 
on the subclass of DAGs under consideration. 

Notice that, for every DAGs N and N' , 



Then, the separation property for 9 implies the separation property for 9b, and the latter 
implies the separation property for 9ab- 

In the papers by Moret, Nakhleh, Warnow, et al mentioned above, it is claimed that 
some of these tripartitions T (and even further refinements of them) satisfy the separation 
property on some specific subclasses of DAGs. In the following sections we show that 
all these claims are incorrect, and then in Section [9] we show a subclass of phylogenetic 
networks where 9, an even the bipartition vr, satisfy the separation property. 

To end this section, we want to mention that in the aforementioned papers, the T- 
tripartition distance is actually not defined as above, but in a normalized version, which 
the authors call error metric: 



This is the function claimed to be a distance on some subclasses of DAGs in those papers. 
It is straightforward to notice that mx satisfies the separation axiom of distances on a 
subclass of DAGs if and only if dx does so, and therefore the counterexamples in the next 
sections also entail that mx neither satisfies the separation axiom when claimed. 

But, contrary to what happens with dx and against what is claimed in several of the 
papers under review (see, for instance, the proof of [14^ Thm. 3]), this normalized version mx 
need not satisfy the triangle inequality either, even on the subclasses of DAGs considered 
in those papers. For one reason, the failure of the separation property allows the existence 



of DAGs N = {V,E) and N' = {V',E') such that T(iV) = T(A^') but, say, \E\ < \E' 
Then, given any other DAG Nq = {Vq^Eq), 



9ab{N) = 9ab{N') 



9b{N) = 9b{N') 



9{N) = 9{N'). 






l/|T(iVo)\T(iV)| , [T(iV)\T(iVo)| 
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and then \E\ < \E'\ implies that 

(M /\n l/ |T(iVo)\T(jV)| , |T(jV)\T(iVo)[ 

1 / |T(jVo)\T(jV)| |T(jV)\T(iVo)K 
2V \Eo\ \E'\ J 

= mr{No,N') +mr{N',N). 

For two specific N = {V, E) and N' = iV',E') such that T(iV) = T(iV') but \E\ 7^ \E'\, 
precisely in the context of Thm. 3 in [T3], see the DAGs A'^g and Niq depicted in Fig. [S]in the 
Appendiji0. They are reduced reconstructible phylogenetic networks (see Section [8] for the 
precise meaning of these words), the subclass where that theorem claims that rriQ satisfies 
the triangle inequality, and they have the same AB-weighted tripartitions (see Table [7] also 
in the Appendix) but different numbers of arcs. This shows that mg^g does not satisfy the 
triangle inequality on the subclass of reduced reconstructible phylogenetic networks. 

Anyway, the failure of the triangle inequality is easily solved for instance using 
instead of mr- The failure of the separation axiom is deeper, as it reflects the impossibility 
of discriminating the phylogenetic networks under consideration using only information on 
their tripartitions. 



4 The first error metric 



The error metric for phylogenetic networks is introduced by Moret, Nakhleh, and Warnow 
in the Technical Report [13]. In it, a phylogenetic network on a set S of taxa is deflned as 
a rooted DAG N labeled in S satisfying the following conditions: 

(4.1) The in-degree and out-degree of each node is 0, 1, or 2, and no node has its in-degree 
equal to its out-degree. 

(4.2) If a node has two children, at least one of them is a tree node. 

(4.3) Weak time consistency: 

(4. 3. a) If ui and U2 are the parents of a hybrid node u, then there do not exist paths 

Ul-^U2 or U2'^Ui. 

(4.3.b) If ui and U2 are the parents of a hybrid node u, and vi and V2 are the parents of 
a hybrid node v, and there exists a path ui-^vi, then there do not exist paths 

V2'^Ui or V2'^U2- 

Notice that in these phylogenetic networks, a hybrid node can have a hybrid child. This 
would correspond to a hybrid node that hybridizes before undergoing a speciation event, a 
scenario that, the authors say, "almost never arises in reality." 

Condition (4.3) is a flrst, weak version of a constant property of Nakhleh- Warnow-et al's 
phylogenetic networks: time consistency. Roughly speaking, this property aims at assigning 
times to the nodes of the network in a way that strictly increases on tree arcs and so that 
the parents of a hybrid node coexist in time; we shall discuss this topic further in the next 
section. In the weak version recalled in this section, and which does not entail in general 
such a timing (cf. Prop. [T] in the next section), this restriction is simply imposed by not 

^To make easier reading this paper, we gather aU large figures depicting phylogenetic networks, as well 
as the tables giving their tripartitions, in an Appendix at the end of the paper. 
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allowing a node to hybridize with its descendants, and by forbidding the ancestors of a 
parent of a hybrid node to hybridize with the descendants of the other parent. 

A phylogenetic network is said to be of class I when each hybrid node has at least one 
parent that is a tree node. 

Then, it is claimed in [I3i. Thm. 4] that 6b satisfies the separation property on the 
subclass of all class I phylogenetic networks. This claim is untrue, because there exist pairs 
of non-isomorphic class I phylogenetic networks with the same sets of S-weighted (even 
AS-weighted) tripartitions. 

Consider for instance the phylogenetic networks A'^i and A''2 labeled in {!,. . . ,5} depicted 
in Fig. [3] in the Appendix. It is easy to check that these two DAGs satisfy conditions 
(4.1) to (4.3) above and that they are of class I. Now, Table [T] displays the Ai3-weighted 
tripartitions of these networks induced by their arcs. A simple inspection of this table 
shows that Oab{Ni) = 6ABiN2). But it is clear that Ni ^ N2. 

It is interesting to point out that 6b, and even 6, satisfy the separation property if we 
moreover forbid the "improbable" event of two consecutive hybridizations: i.e., if we impose 
not only that some child of a tree node is a tree node (condition (4.2)), but also that the 
only child of an internal hybrid node is a tree node, a condition that would be imposed 
in later versions (see conditions (5.2), (6.2), and (8.2)). We devote Section [U] to prove this 
fact. 

5 The second version: introducing strong time consistency 

A new version of the tripartition metric is presented in the Technical Report [llj . This is 
the metric used, for instance, in the paper [T7]. The main difference between the proposal 
in this new Technical Report and the previous one [13] is the refinement of the notion of 
phylogenetic network, by distinguishing between model and reconstructihle phylogenetic 
networks and strengthening the time compatibility and class I conditions. 

In [11] , a model phylogenetic network on a set S of taxa is defined as a rooted DAG N 
labeled in S satisfying the following conditions: 

(5.1) The root and all internal tree nodes have out-degree 2. All hybrid nodes have out- 
degree 1, and they can only have in-degree 2 (allo-polyploid hybrid nodes) or 1 (auto- 



(5.2) The child of a hybrid node is always a tree node. 

(5.3) Strong time consistency: Let x, y be any two nodes for which there exists a sequence 
of nodes {vq,vi, . . . ,Vk) with vq = x and = y such that: 

• for every i = 0, . . . , /c — 1, either (wj, Vj+i) is an arc of A^, or (wj+i , Vi) is a network 
arc of N; 

• at least one pair (fj,fi+i) is a tree arc of N; 

(that is, (fo, . . . , f fc) is a path from x to y containing some tree arc of N , in the graph 
N* obtained from N by adding the inverses of all network arcs). Then, x and y 
cannot have a hybrid child in common. 

^In our classification of nodes in DAGs in Section [21 these auto-polyploid hybrid nodes, with in-degree 
and out-degree equal to 1, would actually be considered tree nodes. 



polyploid hybrid nO' 
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This new notion of time consistency generalizes the one given in the previous version, 
and, as we shall see in a minute, it captures the notion of timing mentioned therein. This 
timing is given by a temporal representation in the sense of Baroni, Semple and Steel [2]: 
a mapping r : y ^ N such that: 

(a) if r is the root of N, then r(r) = 0; 

(b) if {u,v) £ Et, then r(u) < t{v); 

(c) if {u,v) € En, then t{u) = t{v). 

Baroni, Semple and Steel prove in loc. cit. the equivalence between the existence of such 
a temporal representation and the fact that a certain quotient graph (essentially obtained 
by identifying hybrid nodes with their parents) of the network is acyclic. Since none of the 
papers on tripartitions we are discussing provides a formal proof of the fact that condition 
(5.3) above is equivalent to the existence of a temporal representation, and for the sake of 
completeness, we provide such a proof here. 

Proposition 1. Let N = {V,E) be a rooted DAG, let Et and En he its sets of tree and 
network arcs, respectively, and let N* = {V,E*) be the directed graph with the same set V 
of nodes as N and set of arcs E* = E U E^^ . The following conditions are equivalent: 

(i) N is strongly time consistent. 

(a) N* does not have any cycle containing some tree arc of N. 
(Hi) N admits a temporal representation. 

Proof, (i)^^(ii) To begin with, notice that if A^* has cycles containing tree arcs of A'^, then 
it has a minimaH such cycle. Indeed, if A^* has cycles containing tree arcs, then it will 
contain one of shortest length, say 

{vo,Vl,...,Vk,Vo). 

If it is not minimal, then Vi = vj for some ^ i < j ^ k (actually j — i ^ 2, because A'^ 
does not contain loops), and hence we have two strictly shorter cycles in A^*, 

{vi,Vi+i, . . .,Vj^i,Vj = Vi) and ivo,vi, . . .,Vi-i,Vi = Vj,Vj+i, . . .,Vk,vo), 

and at least one of them contains the tree arc that belonged to the original cycle, which 
leads to a contradiction. 

Assume now that A^ satisfies the strong time consistency condition and that A^* has a 
minimal cycle 

{vo,Vi, . . .,Vk,Vo) 

containing some tree arc of A^: without any loss of generality, we shall assume that (ffc, fo) 
is a tree arc of A^, and in particular that vq is a tree node. 

Since A^ is acyclic, this cycle must contain some arc in E^^ . Let i G {1, . . . ,k — 1} be 
the first index such that {vi,Vi+i) € E'j^^: in particular, Uj+i is one of the parents in A^ 
of the hybrid node Vi (notice that i 0, because vq is a tree node of A^, and that i ^ k 
because {vk,vo) is a tree arc of A^). Then the arc (f j-i, Vi) is a network arc of A^, and since 
the considered cycle is minimal, Vi-i must be different from Wj+i. 

^By a minimal cycle {vo,vi, . . . ,Vk,vo) we mean a cycle such that the nodes vo,vi, . . . ,Vk are pairwise 
different. 
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But in this case the sequence of nodes 

{Vi+l, ■ ■ ■ ,Vk,Vo,Vl, . . .,Vi^l) 

is a path in N* containing at least one tree arc and connecting two parents of a hybrid 
node of N, which contradicts the strong time consistency condition. This shows that N* 
cannot contain any minimal cycle containing some tree arc of N. 

(ii) ^=^(iii) If N* does not have any cycle containing some tree arc of N, then the 
mapping 

that sends each v (zV to the maximum number of tree arcs in a path from the root r to u 
in N* is well defined, and it clearly satisfies conditions (a) to (c) in the statement. 

(iii) ^>(i) Assume that a mapping r as described in (iii) exists. Then, if (vq, . . . ,Vk) 
is a path in N* containing some tree arc of A^, we have that r(fo) < T{vk), and then vq 
and Vk cannot be the parents of a hybrid node u, because the latter would imply that 
r{vo) = t{u) = T{vk). □ □ 

In reconstructible phylogenetic networks the previous conditions are relaxed: tree nodes 
can have any out-degree greater than 1; hybrid nodes can have any in-degree greater than 
1 and any out-degree greater than (in particular, auto-polyploid hybrid nodes are forbid- 
den); a hybrid node can have hybrid children; and the strong time consistency need not 
hold any longer. 

Now, two nodes u,v of a (model or reconstructible) phylogenetic network are said to be 
convergent when they satisfy the following condition: 

For every leaf s and for every k ^ 0, there exists a path u s containing k 
hybrid nodes if and only if there exists a path v-^s containing k hybrid nodes. 

A phylogenetic network is said to be of class I if it does not contain any pair of convergent 
nodes. 

Then, it is claimed in |1H Thm. 4] that 9ab satisfies the separation property on this new 
class I of phylogenetic networks. It is again incorrect, because there still exist pairs of non- 
isomorphic class I model phylogenetic networks with AB-weighted error rate 0. Consider 
for instance the model phylogenetic networks A3 and A4 labeled in {1, . . . , 13} depicted 
in Fig. m in the Appendix, which are suitable modifications of those given in Fig. O to 
cope with the new restrictions on model phylogenetic networks (much simpler examples 
exist involving reconstructible networks containing, for instance, out-degree 3 tree nodes: 
consider the networks Ng and Aio shown in Fig. [Hj) . 

These networks have no pair of convergent nodes (as it can be checked in Table [2]) 
and they are clearly non-isomorphic. Now, Table [2] displays the tripartitions of these 
networks induced by their arcs and, again, a simple inspection of this table shows that 

eAB{N3)=eAB{NA). 

6 The third version: introducing the reticulation scenarios 

In the Technical Reports |12pi6j the authors present a third version of the tripartition 
metric, now including a substantial change in the definition of the metric itself. 

The definition of phylogenetic network in these papers is that of model phylogenetic 
network in the previous section with a small modification in the time compatibility con- 
dition. More specifically, a phylogenetic network on a set S of labels is a rooted DAG N 
labeled in S satisfying the following conditions: 
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(6.1) The root and all internal tree nodes have out-degree 2. All hybrid nodes have out- 
degree 1, and they can only have in-degree 2 or (in [12]) 1. 

(6.2) The child of a hybrid node is always a tree node. 

(6.3) Time consistency: Let x, y be any two nodes for which there exists a sequence of 
paths {Pq, Pi , . . . , Pfc) in such that 

• X is the origin of Pq and y is the end of P^ ', 

• each path contains some tree arc; 

• for every i = 0, . . . ,k — 1, the end of Pj and the origin of Pj+i are the parents of 
a hybrid node. 

Then, x and y cannot have a hybrid child in common. 

This time consistency condition (6.3) is actually weaker than the strong time consistency 
(5.3). For instance, Fig.[T]shows two situations where condition (6.3) allows the nodes u and 
V to hybridize, while condition (5.3) forbids it. Notice nevertheless that, under condition 
(6.2), time consistency (6.3) becomes equivalent to strong time consistency (5.3) if we simply 
ask at least one of the paths Pi (instead of all of them) to contain some tree arc. 




© © 

Figure 1: Time compatibility allows the nodes u and v in these graphs to hybridize, while 
strong time compatibility does not. 

In these papers the authors do not consider any class I of phylogenetic networks, but 
instead they refine the error rate by adding some extra information to tripartitions induced 
by network arcs. Namely, they define the reticulation scenario RS{v) of a hybrid node v 
with parents ui,U2 as the set of clusters of its parents: 

RS{v) = {C{ui),C{u2)}. 
Then, the data the author^ consider on arcs are: 

• If e is a tree arc, then "^^^\n) = 6^^j^{e); 

• if e is a network arc with head V, then = (0W(e), P5(v)). 

We shall call \I'(^)(e) the enriched (AB -weighted) tripartition of N associated to e. Notice 
that ^'(■^^(e) still depends always only on e's head. 

Let us denote by ^(N) the set of all these enriched tripartitions: 

^(A) = {^(^)(e) \ eeE}. 

''Actually, the tripartitions they use are AB- weighted in \12^ and unweighted in [I6j. For the sake of 
generality, in this section we shall use 8ab- 
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Then, in the papers quoted above, these sets ^'(A^) are used to define a metric mijr by means 
of a formula similar to that of my recalled in Section [3l 

It is claimed in [12^ §5.4] and [16^ §5] that ^ satisfies the separation property on the 
class of all phylogenetic networks, in the sense that if ^'(A'^i) = then A'^i = N2, for 

every phylogenetic networks A^i and A''2 labeled in the same set S. And again, it is untrue, 
as there exist pairs of non-isomorphic phylogenetic networks with the same sets of enriched 
tripartitions. For instance, the phylogenetic networks and labeled in {1, . . . , 13}, 
used already in the previous section and depicted in Fig. [H They have the same hybrid 
nodes, and we have already seen in Table [2] that they have the same sets of tripartitions 
induced by tree arcs as well as the same sets of tripartitions induced by network arcs. 
Table [3] shows that their hybrid nodes have the same reticulation scenarios. From these 
two tables, one can easily check that ^(N^) = ^{N/j^). 

7 Nakhleh's thesis: introducing the tree-sibhng condition 

In his PhD Thesis [T3], Nakhleh uses the enriched tripartitions ^ (actually, he uses un- 
weighted tripartitions, but for the sake of generality we shall still use them AB-weighted) , 
but he restricts the subclass of phylogenetic networks where ^' is stated to satisfy the 
separation property. 

The definition of model phylogenetic network in this work is that of phylogenetic network 
given in the last section (conditions (6.1), (6.2) and (6.3)), while in reconstructible networks 
these conditions are relaxed as in Section [5j 

Then he defines a phylogenetic network to be of class I when every hybrid node has 
at least one sibling that is a tree node. We shall say henceforth that such a phylogenetic 
network satisfies the tree-sibling condition, or simply that it is tree-sibling, to distinguish 
these networks from previous class I networks defined through the absence of convergent 
pairs: notice that, for instance, phylogenetic networks and A^4 in Fig. have no pair 
of convergent nodes, but they are not tree-sibling, while networks and A'^g in Fig. [5] 
in the Appendix are tree-sibling, but have pairs of convergent nodes (see Table H]) . The 
phylogenetic networks used in [HlllO] (obtained by adding network arcs to a phylogenetic 
tree by repeating the following procedure: choose pairs of arcs {ui,vi) and {u2,V2) in the 
tree; split the first into {ui,wi) and {wi,vi), with wi a new (tree) node; split the second 
one into {u2, W2) and (^2,^2)1 with W2 a new (hybrid) node; finally, add a new arc (101,102)) 
are tree-sibling. 

Nakhleh claims [15, Thm. 4 in Ch. 6] that ^ satisfies the separation property on the 
subclass of all tree-sibling phylogenetic networks. It is false, as there exist pairs of non- 
isomorphic tree-sibling model phylogenetic networks with the same sets of enriched tripar- 
titions Consider for instance the phylogenetic networks and Nq labeled in {1, . . . , 10} 
depicted in Fig. [5] in the Appendix. They are model phylogenetic networks (they even 
satisfy the strong time consistency condition (5.3) instead of the time consistency condition 
(6.3)), they satisfy the tree-sibling condition, and they are clearly non-isomorphic. 

Table H] displays the tripartitions of these networks induced by their arcs, and Table [5] 
gives the reticulation scenarios of their hybrid nodes (which are the same in both networks). 
From these two tables, one can easily check that ^(N^,) = ^(Nq). 
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8 Tripartitions do not distinguish distinguishable networks 



In the final paper of this series, Moret, Nakhleh, Warnow et al assert that their triparti- 
tions can be used to distinguish networks up to a certain notion of reduction that we recall 
below, from where they deduce that 6 satisfies the separation property on a very restricted 
subclass of phylogenetic networks. 

The notion of model phylogenetic network is this paper is exactly that of the last two 
sections. As far as reconstructible phylogenetic networks goes, they do not relax them as 
much as in previous papers. More specifically, a reconstructible phylogenetic network on a 
set S of labels is defined as a rooted DAG labeled in S satisfying the following conditions: 

(8.1) The root and all internal tree nodes can have any out-degree greater than 1. All 
hybrid nodes have out-degree 1, but they can have any in-degree greater than 1. 

(8.2) The child of a hybrid node is always a tree node. 

(8.3) Time consistency property (6.3). 

A subset U of internal nodes of V is said to be convergent when it has at least two 
elements, and all nodes in it have exactly the same cluster (contrary to previous versions, 
no condition on the number of hybrid nodes in the paths to descendant leaves is required). 
The removal of convergent sets is the basis of the following reduction procedure: 

(0) Replace every clade by a new "symbolic leaf" labeled with the names of all leaves in 
it. 

(1) For every maximal convergent set U, remove all nodes in paths from nodes in U to 
(symbolic) leaves, including the node in U but keeping the leaf. For every node x 
that is the tail of an arc whose head v has been removed, and for every leaf s in the 
cluster of v, add a new arc (x, s). 

(The resulting network contains no convergent set of nodes, because this step does 
not change the clusters of the surviving nodes.) 

(2) Append to every symbolic leaf representing a clade the corresponding clade, with an 
arc from the symbolic leaf to the root of the clade. 

(3) Replace every path of length greater than 1 with all its intermediate nodes of in- and 
out-degree equal to 1 by a single arc from its origin to its end. 

(In particular, if a symbolic leaf turns out to have only one parent, then it is removed 
and the root of the corresponding clade is appended to its first ancestor with out- 
degree different from 1.) 

The output of this procedure applied to a reconstructible phylogenetic network N is 
a DAG R{N) labeled in S. The network R{N) is called the reduced version of A'^. Two 
networks A^i and N2 are said to be indistinguishable when they have isomorphic reduced 
versions, that is, when R{Ni) = R{N2). 

Since every hybrid node in and its only child form a convergent set, they are removed 
in step (1) together with all their descendants until the clades' symbolic leaves. On the 
other hand, in R{N) the symbolic leaves may have more than one parent, and then they 
are the only possible hybrid nodes in R{N). So, in particular, no hybrid node in R{N) is a 
descendant of another hybrid node. Moreover, since all convergent sets and all nodes with 
in- and out-degree 1 in are removed, the only possible convergent sets in R{N) consist 
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of a hybrid node and its only child (that is, a symbolic leaf with more than one parent and 
the root of the corresponding clade). 

We want to point out that the reduced version of a reconstructible phylogenetic net- 
work need not be a reconstructible phylogenetic network. Consider for instance the simple 
network N in Fig. [2] below. The nodes a,b,r form a convergent set, and therefore they 
are removed in the reduction process. Then, the reduced version of is a non connected 
DAG consisting of four arcs with heads the leaves of N. As another example, consider the 
reconstructible phylogenetic network A'g in Fig. [6] in the Appendix: its reduced version, 
which is shown in Fig. [71 does not satisfy the time consistency property. This shows that 
reduced versions need not be time consistent. 




Figure 2: A model phylogenetic network N (left) and its reduced version (right) 

It is claimed in [14| Lem. 1] that if 0{Ni) = 9{N2), then A^i and sue indistinguishable 
(the converse does not hold in general, because the reduction process may remove parts 
with different topologies that yield differences in the sets of tripartitions). This is not 
true. Consider for instance the model phylogenetic networks and depicted in Fig. [6] 
in the Appendix. Table [6] displays the tripartitions of these networks induced by their 
arcs, showing that the sets of tripartitions are the same (we give actually the AS-weighted 
tripartitions, just to show that their claim is still false if we replace 9 by 9ab)- Fig. [7] shows 
the reduced versions of these networks, and they are clearly non isomorphic. 

The authors also claim [lA^ Thm. 3] that 9 satisfies the separation property on the 
subclass of reduced reconstructible phylogenetic networks (that is, of reconstructible phy- 
logenetic networks that remain untouched under the reduction procedure). This is also 
wrong. Consider for instance the networks Ng and A^io depicted in Fig. [8] in the Appendix. 
They are reconstructible phylogenetic networks, and they are reduced because the only 
convergent sets they contain consist of a hybrid node and a leaf that is its only child, and 
therefore the application of the reduction procedure leaves them untouched. Table [7] shows 
that these two networks have the same sets of Ai?-weighted tripartitions, but they are 
clearly non-isomorphic. 

9 Tree-child phylogenetic networks 

We have seen in previous sections that neither the lack of convergent pairs of nodes, nor 
the tree-sibling condition or even the property of being reduced, in all cases combined with 
the strong time consistency condition, guarantee the separation property for 9ab- In this 
section we introduce a stronger condition that, by itself, does not guarantee this separation 
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property either, but that combined with condition (4. 3. a) makes de satisfy the separation 
property. Even more, it makes bipartitions vr satisfy the separation property. 

We shall say that a DAG satisfies the tree- child condition, or simply that it is tree- child, 
when every node other than a leaf has at least one tree child. A tree-child phylogenetic 
network is a rooted tree-child DAG with no tree node of out-degree 1 and all hybrid nodes 
of out-degree exactly 1 (and any in-degree greater than 1). So, tree-child phylogenetic 
networks can be understood as models of reticulated evolution where: 

• The tree nodes represent species. 

• The hybrid nodes represent recombination or lateral gene transfer events that yield 
the species corresponding to their single tree child. 

• Every species other that the extant ones, represented by the leaves, have some de- 
scendant through mutation. 

The (even enriched) tripartitions introduced so far do not satisfy the separation property 
on the subclass of all tree-child phylogenetic networks. Consider for instance the networks 
A'^ii and N12 depicted in Fig. [9] in the Appendix. Table [8] provides the ^I?-weighted tri- 
partitions and reticulation scenarios of their arcs, showing that these networks cannot be 
distinguished using this information. 

But if we add to the tree-child condition the weakest form of time consistency, namely 
condition (4. 3. a) (two parents of a hybrid node cannot be connected by a path), then 9 sat- 
isfies the separation property on the resulting subclass of phylogenetic networks. Actually, 
it turns out that the bare sets of clusters of nodes (without distinguishing between strict 
and non-strict descendants, and without taking into account the numbers of hybrid nodes in 
paths to leaves) are enough to characterize these phylogenetic networks up to isomorphism 
(Thm. [8j), which entails that vr satisfies the separation property, just as in phylogenetic 
trees. 

To prove these facts, we need to establish some preliminary definitions and results. 

We shall denote henceforth by C^^^ (f ) the cluster of a node f in a DAG A'^ labeled in S, 
to emphasize the network. For every DAG N = (V,E), let C{N) denote the set of clusters 
of its nodes: 

C{N) = {C'^^\v) \veV}. 

A tree path in a DAG is a path consisting entirely of tree arcs. A node f is a tree 
descendant of a node u when there exists a tree path from u lo v. 

Lemma 2. Every node v in a tree-child phylogenetic network has some tree descendant 
leaf. 

Proof. If V is not already a leaf, we can construct a tree path starting in v by successively 
taking tree children, and this path will end in a leaf that will be a tree descendant of 
V. U U 

Lemma 3. Let u-^v be a tree path in a DAG N. Then, for every other path w~^v ending 
in V, either it contains u-^v or u-^v contains w~^v. 

Proof. Let {u,vi, . . . ,Vk-i,v) be the tree path u v in the statement: to simplify the 
notations, we call vq = u and = v. Let now w-^v he any other path ending in v and let 
Vj be the first node in the path u~^v such that {vj, . . . ,Vk) is contained in w-^v. If j ^ 0, 
then Vj has only one parent, and therefore it must happen that either Vj = vu 01 fj's parent 
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in the path w-^v is also its parent in the path u-^v, and in particular it also belongs to 
this path, which contradicts the assumption on Vj. 

This proves that either vj = w, in which case the path u-^v contains the path w-^v, 
or Vj = u, in which case the path w~^v contains the path u-^v. □ □ 

Corollary 4. If v is a tree descendant of u in a DAG N , then v G A[u) and the path u-^v 
is unique. 

Proof. If is a tree descendant of u, then there exists a tree path u v. Then, by the 
previous lemma, every path from a root r to t; must contain this path u-^v, which shows 
that u is a strict descendant of u, and every other path from uio v must contain (and hence 
be equal to) this path u-^v. □ □ 

Lemma 5. Let N = {V,E) be a tree- child phylogenetic network satisfying condition (4-3. a). 
For every nodes u,v £ V , the following conditions are equivalent: 

ft) cW(u) = C7W(?;) 

(a) u = V or {u,v} are a hybrid node and its only child. 

Proof. The implication (ii)^=^(i) is obvious. As far as the implication (i)^^(ii) goes, 
assume that C^^\u) = C^^\v) and that u ^ v. If s is a tree descendant leaf of u, then 
s E C^^\v), and hence, by LemmaO either u belongs to the path v-^s or v belongs to the 
path u~^s. Therefore, u and v are connected by a path. To fix ideas, assume that there 
exists a path u~^v. Then we must distinguish three cases: 

• If u is a tree node and it has some tree child w outside the path u-^ v, then every 
tree descendant leaf s of if is a tree descendant leaf of u and hence, since 
C^^\v), a descendant of v. By the uniqueness of the path u-^ s (Corollary H]), the 
tree path s must be equal to the concatenation of the path u-^v and the path 
v-^s, but these paths are different because their first arcs are different. This yields 
a contradiction. 

• If n is a tree node and all its children outside the path u-^v are hybrid, take one such 
hybrid child w. Let s be any tree descendant leaf of w. Then s € C'^^\u) = C^^\v) 
and therefore there exists a path v s. Now, by Corollary IU the path w s is 
unique, and therefore the path n~^s is also unique. Indeed, given any path n~^s, by 
Lemma [3] and since u cannot be a descendant of w, the path w~^s must be contained 
in this path and since no other parent of w is a descendant of u by (4. 3. a), the 
only possibility is that the first arc of this path is {u, w) and then the rest of the path 
is the tree path w-^s. 

This means that the path obtained by concatenating the paths u-^v and v~^s must 
be equal to the path u-^ s through w, but, again, these paths are different because 
their first arcs are different. This yields again a contradiction. 

• If n is a hybrid node and u' is its unique tree child, then C^^\u') = C^^\u) = C^^\v) 
and u' must be the first node after u in the path u-^ v. This yields a path u' v 
with C^^\u') = C^^\v) and u' a tree node. The last two points have shown that 
the assumption that u' ^ v leads to a contradiction, while u' = v is clearly possible. 

In summary, the only situation that does not lead to a contradiction is when n is a hybrid 
node and v its only child. □ □ 
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Lemma 6. Let N = iV^E) he a tree- child phylogenetic network satisfying condition (4. 3. a). 
For every nodes u,v G V such that C^'^^u) 7^ C^^\v), the following two conditions are 
equivalent: 

(i) There is a non-trivial path u-^v. 
(11) C^^\v) C C7W(n). 

Proof. The implication (i)^=^(ii) is straightforward: if t> is a descendant of n, then C*^^-* {v) C 
C^^\u), and by assumption these clusters are different. 

As far as the converse implication goes, assume that C^^\v) C CW(u), and let s be 
a tree descendant leaf of v. Then s G C^^\u) and Lemma [3] entails that u and v are 
connected by a path. Since clusters decrease with paths, this path must he u-^v. □ □ 

Now, given a tree-child phylogenetic network N = (V,E) satisfying condition (4. 3. a), 
its contracted version is the DAG N' = (V, E') obtained from N by contracting into one 
node each pair of nodes consisting of a hybrid node and its only child: more specifically, for 
every hybrid node u, if u is its only child and ui, . . . , the children of n, then we remove 
this node u and the arcs incident to it, and we replace the latter by new arcs (u, ui),. . . , 
(u, Mfc). In this way, we understand V as a subset of F, consisting of all nodes of except 
the children of hybrid nodes. 

It is clear that C'^'^\v) = C^^'\v) for every node v £ V , and hence that C{N) = C{N'). 
Moreover, C^^ \u) 7^ C^^ \v) if u ^ v, because each pair of nodes in A^ with the same 
cluster has been contracted to a single node in A^'. In particular, the mapping 

cm : V' ^ C{N') 

is bijective. 

On the other hand, for every u,v €z V', there exists a path u-^ v in N if and only if 
there exists a path u-^v in N'. Therefore, from Lemma Owe deduce that there exists a 
path u-^v in N' if and only if C^^'\v) C C^'^'\u): that is, the inclusion of clusters in A^' 
captures exactly the path ordering in A^', which is the restriction to V of the path ordering 
on A^. 

Lemma 7. Let N be a tree- child phylogenetic network satisfying condition (4 -3. a), and let 
N' = {V\ E') he its contracted version. For every u € V' , let 

Mu = {w(^V'\ C(^')(u;) C C(^')(u)}. 

Then, the maximal elements of with respect to the path ordering on N' are exactly the 
children of u in N' . 

Proof. If n is a leaf, then C^^'\u) = {u} and Mu = 0, and the thesis of the statement 
clearly holds. So, assume that u is not a leaf. Then, every descendant of u is in M„ and 
therefore is non-empty. 

Since M„ is finite, it has maximal elements. Let v be any such a maximal element, 
bmce C^^'\v) C C(^')(n), there exists a non trivial path u-^v in N' . If this path passes 
through some other node w, then C(^')(w) C C7(^')(u;) c 

(u), against the assumption 
that V is maximal in M„. Therefore, the path u-^v has length 1 and u is a child of u. 

Conversely, let v he a child of u. If it is not maximal in Mu, then there exists a path 
u V different from the arc {u,v). Let w he the parent of v in this path. Then v has 
(at least) two parents, u and w, and there is a path u-^w in N'. When we translate this 
situation to A^, we have essentially four possibilities: 
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• V is a, hybrid node with parents u, w and there exists a path u-^w in N connecting 
them. 

• v is a hybrid node with parents u and the tree child w of the hybrid node w. Then 
the path u w in N' corresponds to a path u-^ w in N that, followed by the arc 
{w,w), yields a path u-^w. 

• V is a hybrid node with parents w and the tree child u of the hybrid node u. Then 
the path u-^w in N' corresponds to a path u-^w in N , and since u is the only child 
of u, the latter contains a path u ^ w. 

• V is a hybrid node with parents the tree child w of the hybrid node w and the tree 
child u of the hybrid node u. Then the path u-^w in N' corresponds to a path u-^w 
in N. Arguing as in the last two points (simultaneously), we deduce that there exists 
a path u~^w in N . 

In all four cases, we obtain a hybrid node of N and a path connecting two parents of it, 
which contradicts (4. 3. a). This implies that v must be maximal in M„. □ □ 

Theorem 8. For every tree-child phylogenetic networks Ni and N2 satisfying the weak time 
consistency property (4-3. a), 

Ni ^ N2 if and only ifC{Ni) = C{N2). 

Proof. Assume that C(iVi) = C{N2), and let N[ = {V{,E[) and N!^ = {V^,E'^) be the 
contracted versions of Ni and N2. Then C{N[) = C{N^). 

Consider the mapping /' : Vl — > V2 obtained as the composition 

y/^_4 C{N[) = C{N!,)''^ Vi; 

that is, /' sends each node v € Vl to the unique node f'{v) € V^' such that C^^'^\v) = 
C^^^\f'{v)). Since and 

are bijective, /' is bijective. Furthermore, v is maximal 
in Mu if and only if f'{v) is maximal in Mj/(^^^ (because these sets are defined through the 
corresponding clusters, and the path ordering in N[ and N2 corresponds to the inclusion of 
clusters). Therefore, {u,v) G E[ if and only if {f'{u),f'{v)) G E!^. So, / is an isomorphism 
of DAGs. 

Finally, u is a leaf of a DAG if and only if its cluster is the singleton {u}. This entails 
that /' sends leaves to leaves and preserves their labels. Therefore, /' is an isomorphism of 
DAGs labeled in S. 

Now, the hybrid nodes in each are the nodes that have in-degree greater than 1 in 
the corresponding A^^', and Ni is obtained from N- by adding a single child u to each hybrid 
node u and replacing all arcs with tail u by arcs with tail u (and the same heads). This 
implies that the mapping 

f:Vi^V2 

that restricts to /' on V( and that sends each node u in \ V-[ to the corresponding 
f'{u) (that is, to the only child of the image of the parent of u) is bijective and preserves 
and reflects the arcs and preserves the leaves' labels. Therefore, it is an isomorphism of 
phylogenetic networks. □ □ 

Corollary 9. The bipartition w, and hence also 6, Ob, Oab, and ^, satisfy the separation 
property on the subclass of all tree-child phylogenetic networks where no pair of parents of 
a hybrid node is connected by a path. 
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Corollary 10. For every T = tt,0,9b,0ab,^ , the mapping 

dr{Ni,N2) = ^(|T(r) \ T(r')| + |T(r') \ T(r)i) 

defines a distance on the subclass of all tree-child phylogenetic networks where no pair of 
parents of a hybrid node is connected by a path. 

10 Conclusion 

In a series of technical reports and papers culminating in [14J, Moret, Nakhleh, Warnow 
and collaborators have introduced an error metric for phylogenetic networks, with the main 
goal of comparing reconstructed networks with true ones and to assess in this way the accu- 
racy of phylogenetic network reconstruction algorithms. In this paper we have shown that 
none of their approaches is free from false equalities: that is, for every one of the metrics 
they introduce, there turn out to exist pairs of phylogenetic networks in the subclass where 
the metric is defined, that are non-isomorphic (or, in the case of |14j . have non-isomorphic 
reduced versions) but cannot be distinguished through the metric. The reason for this lack 
of discriminating power is that non-isomorphic networks in the subclasses under consider- 
ation may have the same sets of tripartitions, which are the networks' representations that 
are compared by this metric. Among these subclasses of networks where the error metric 
fails, we want to stress the tree-sibling, strongly time consistent phylogenetic networks (see 
Section [7]), for which several reconstruction algorithms were recently proposed [OtllOj. 

We have also shown a subclass of phylogenetic networks, the tree-child, weakly time 
consistent phylogenetic networks, where tripartitions, and even bipartitions in the sense 
of Bourque-Robinson-Foulds, single out its members, and therefore they can be used to 
define a true metric. Tree-child phylogenetic networks can be seen as models of reticulate 
evolution histories where every species other than the extant ones have some descendant 
through mutation. 

Several questions and problems arise as a consequence of our work that are in our current 
research agenda. On the one hand, what is the discriminating power of tripartitions? Is 
there a well-defined class of phylogenetic networks where equality of sets, or multisets (sets 
with repetitions), of tripartitions implies isomorphism? And, what does it really mean, 
from the topological point of view, to have the same (multi)sets of tripartitions? 

On the other hand, it is still necessary to define true distances generalizing the bipar- 
tition distance, on more general subclasses of phylogenetic networks than those tree-child, 
weakly time consistent. We have recently defined one such metric (not based on triparti- 
tions) on the class of all tree-child phylogenetic networks [S], but, in the light of [ElllOj, we 
consider a more relevant target the class of all tree-sibling networks. 
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Appendix 



In this Appendix we collect all depictions of phylogenetic networks and their tripartitions. 

In graphical representations of phylogenetic networks, and of DAGs in general, hybrid 
nodes are represented by squares and tree nodes by circles. In those cases where the strong 
time consistency condition is considered, nodes are labelled with its corresponding r (see 
Proposition [T|) as subscript, to ease the verification of this condition. 

In the tables presenting tripartitions we shall make use for simplicity of the following 
conventions: we only provide the sets A and B, as can be trivially deduced from them; 
the labels' weights are shown as subscripts, the lack of subscript meaning weight 0; and 
since the tripartition induced by an arc only depends on its head, we identify the arcs by 
means of their heads. 




Figure 3: The networks A^i (left) and (right) 
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Table 1: Tripartitions of the networks in Fig. [3] 
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Table 2: Tripartitions of the networks in Fig. H] 
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Table 3: Reticulation scenarios of the hybrid nodes of the networks in Fig. H] 
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Figure 4: The networks (up) and (down) 
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Table 4: Tripartitions of the networks in Fig. [5] 
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Table 5: Reticulation scenarios of the hybrid nodes of the networks in Fig. [5] 
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Figure 7: The reduced versions R{Ni) (left) and R{Ns) (right) of the networks given in 
Fig. El 
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Table 6: Tripartitions of the networks in Fig. [H] 




Figure 8: The networks A'^g (left) and A^io (right) 
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Table 7: Tripartitions of the networks in Fig. [8] 




Figure 9: The networks A'"!! (left) and A'"! 2 (right) are tree-child but cannot be distinguished 
by means of their tripartitions 
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Table 8: Tripartitions and reticulation scenarios of the tree-child phylogenetic networks in 
Fig. El 
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